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PETECTION of EPIGENTIC ABNOBMAtlTIES aad DIAGNOSTIC 
METHOD BASED THEREON 



5 The present invention relates to identification of epigeneiic abnonnalities. More 

particularly^ the present invention relates to diagnosis of diseases based on DNA 
methyJation differences, and identification and isolation of genes that cause such 
diseases. 

10 BACKGROUND OF THE INVENTION 

Substantial progress has been made in recent years with respect to ttie diagnosis 
and treatment of diseases in which a single defective gene is responsible. Traditional 
lintcage studies have effectively isolated the causal gene and allowed for the fiitther 

15 development of diagnostic tests and furdiered research into treatments such as gene 
iher^y for conditions such as cystic fibrosis, Duchennes muscular dystrophy, 
Huntington's disease and fragile X syndrome. However, similar progress has not been 
made in diseases caused by mutations in multiple genes. Traditional linkage studies in 
complex diseases sue* as schizophrenia, bipolar disorder, cancers and diabetes have only 

20 succeeded in isolating chromosome regions, often conlaimng200-300 genes. The ability 
to screen such a large number of genes is clearly a time-consuming and daunting task. 

Epigenetic mechanisms can be an important factor in complex, mulii-factorial diseases 
such as cancers. Epigenetics refers to modifications in gene expreaion that are brou^t 

25 abottibyheriiable.bmpotcniiallyreversiblcchanges inDNAmethylationandchromatin 
strociurc (HenifcofT S. Mat2ke MA Exploring and explaining epigenetic effects. Trends 
Genet l997,13(8):293-5; Siegfried Z, Eden S, Mendelsohn M, Feng X, Tsubert BZ. 
Cedar H. DNA mefhylation represses transcription in vivo. Nat Genet 1999, 
22(2):2a3-206;Gon2algo,M.L.andJottes..P.A,(l997)Mmagenicandepi^ 

30 of DNA methylation. Matal. Res. 386(2), 107-18; Razin, A. and Shemer, R. (1999) 
Epigenetic control of gene expression. Resuhs Probl. Cell. Differ. 25, 189-204; Lyko, F. 



JIffl-OSnZOOZ I3:SZ ^^l.ngs Fax 3 

■l Kit 



-2 



and Paro, R. (1 999) Chromosomal elements confening epig^etic iiib^tance,Bioessays 
21(10), 824-32). DMA meAylwon of the bindms sites for iFanscripuon factors changes 
the affinity of such fhciors for regulatory sequences, which affects the nranscriptional 
activity of a gene (Ehrlich M and Ehrlicb K (1993) Effect of DNA methylation and the 

S binding of vertebrate and plant proteins to DNA. In: Jost JP and Saluz P (eds) DNA 
Methylation: Molecular Biology and Biological Significance pp. 145-168. Birkhauser 
Verlag, Basel. Switzerland; Riggs A, Xiong Z, Wang U and LeBon JM (1998) 
Methylation dynamics, epigenetic fidelity and X chromosome structure. In: Wolffe AP 
(ed) Epigenetics, pp. 214-227. John Wiley & Sons, Chistester). In addition to positional 

10 effects of methylated cyiosines> density in a gene regulatory region also coniributes to 
gene activity. This type of regulation is mediatedby methylated cyiosine binding proteins 
and acetylation of histones ( Jones PU Veenstra GJ, Wade PA, Verma^ D» Kass SU, 
Landsberger N, Strottboidis J, and Wolffe AP (1998) Meth^dated DNA and MeCP2 
recniit histone deacetylase to repress transenption- Nature Genetics 19: 1 87-91 ; Nan X» 

15 Ng HH. Johnson CA, Laheity CD, Turner BM. Eisenraan RN, and Bird A (1998). 
Tianscriptiona] rqiression by the methyl-CpG-bindingpiotein MeGP2 involves a histone 
deacetylase complex. Nature 393; 38fi-9; Robertson KP and WoI£fe AP (2000) DNA 
methylation in health and cUsease. Nature Review Genet 1:11-9). 

20 Methylation can occur within cytosinc-guanosine islands (CpG islands) that are typically 
between 0.2 to about I kb in length and are located upstream of many housekeeping and 
tissue-specific genes, but may also extend into protein coding regions. Methylation of 
cytosine residues contained within CpG islands of certain genes has been inversely 
correlated with gene activity. This could lead to decreased gene expression by a variety 

25 of mechanisms including, for example, disruption oflocal chromatin structure, inbibiiioa 
of nranscnption factor-DNA bindtng, or by recruitment of proteins which interact 
^ectfiGaI1ywiihmeAy]atedsequeru:e& indirectly preventing U'anscripti^^ 
Some studies have demonstrated an inverse correlation between methylation of CpG 
islands and gene expression. Tisfiue-spectfic genes are usually umnethylated within the 

30 receptive target organ cells but are methylated in the germline and in non-expressmg 
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aduft tissues. CpQ islands of consmurtve^y-exprcssed housekeeping genes are normaUy 
iinmeiliylated in ihe gennline and in somatic tissuea. 

USS871917 discloses methods for detecting epigenetic abnonnalities comprising: 
S restriction of genomic DNA with a methylation-sensitive restriction enzyme (arestriciion 
enzyme that cleaves an unmethylated 8iie» but does not cleave the same site if it is 
methylated) that leaves an overhang; ligation of adaptors to the overhangs; PCR 
amplification with primers directed to the adaptors; followed by a subnactive 
hybridization to eliminatehotisekeepinggenes; and a second round of PCR amplification 
10 with a second set of primers directed to a second set of adaptors. A problem wiih this 
design is that the method is Ihnited to a testricnon enzyme that leaves overhangs and, 
further, the method is complicated due to the ligation of two sets of adaptors. 



WO99/0I S8D discloses mettiods for detection of senoraic imprinting disorders based on 
15 digestion of genomic DNA with methylaiion-scnsiiive restriction enzymes and PCR 
anq>lification using primers. One embodiment, directed to the detection of unmethyjared 
sequences, requires the use of a resiriciion enzyme that leaves overhangs and the use of 
exogenous adaptors, and therefore suffers ftom similar disadvantages as those described 
above in regards to 1JS5871917. Another embodiment, directed to the detection of 
20 methylated sequences, uses primers directed to endogenous elements such that exogenous 
adaptors are not required* but these primers are required to be positioned on either side 
of a methylation-sensiiive restnciion site. Since a methylation sensitive restriction 
enzyme will cut an unmethylated site, this nteihod can only he used to ampli& the 
methylated sequences, and caraun produce an unmethylated sequence which will be cut 
25 in between the two primers. 



30 
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li is an object of the preseitt invention to overcome disadvantages of the prior art. 

The diove object is met by a combination of the features of the main claims. The 
S sub claims cfisclose fimher advantageous embodiments of fiie invention. 

SUMMARY OF THE INVENTION 

10 The present invemion relates to detection of cpigenctic abnonnaUtids and 

diagnosis of non-Mendelian diseases associated with epigenetic abnotmalities, and 
identification and isolation of genes thai cause such diseases. 

Accoiding to the preseni invention there is provided a method of delecting an 
I S epigenetic abnormality associated wiih a non-Mendelian disease, the method comprising: 

a) extraction of genomic DNA fiom a sample that exhibits characteristics of a non- 
Mendelian disease; 

b) digestion of the genomic DNA ivilh a methylaiion-sensirive restriction enzyme to 
produce a pool of restricted DNA fragments; 

20 c) fractionation of the pool of resincted DNA firagments to obtain DNA fragments of a 
desired size; 

d) amplification of at leasi; a segmem of the DNA fragm^ts of a desired size with 
primers that anneal to an endogenous DNA element to produce a PGR product; 

e) clomng of the PGR product into a sequencmg vector, 

25 0 sequence determinarion of the PCR product to obtain a sequence of the PCR product; 
comparing flie sequence against a genomic database to assign a locus for the epigenetic 
abnormality associated with a non-Mendelian disease, 

A non-Mcndelian disease is any multi-factorial disease such as schizophrenia, bipolar 
disorder* cancer, and diabetes. 
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The sample ftom which DNA is cxiracied may he any cell, tissue, organ or oiher suiuble 
specimen that exhibits characteristics of a non~Mendelian disease. For example^ without 
wishing TO be limiting, in an Individual suffering ftom schizophrenia or bipolar disorder 
a sample may be obtained from brain tissue. 

5 

Any endogenous DNA element that is found to have epigeneric abnormalities aasodaied 
with a nonrMendelian disease canbe PCR amplified accoyduig to the present invention, 
bi a fimher aspect, Ote endogenous DNA element is a multi-copy DNA element. In a stUl 
fimber aspect, the multi-copy DNA elemem is selected from the groiq> consisting of 
10 UNE. SINE. LI, and Alu. 

In another aspect^ the present invention provides a method of identi&ing a gene having 
an epigenetically altered expression pattern that contributes to a non-Mendelian disease 
in an organism^ the method comprising: 
15 a) extraction of genomic DNA from a sample that exhibits characteristics of a non- 
Mendelian disease; 

b) digestion of the genomic DNA with a methylation-sensitive restriction enzyme to 
produce a pool of restricted DNA fragments; 

c) fractionation of the pool of restricted DNA fragments to obtain DNA fragmocits of a 
20 desired size; 

d) amplification of at least a segment of the DNA fragments of a desired size with 
primers thai anneal to an endogenous DNA element to produce a PCR product; 

e) cloning of the PCR product into a sequencing vector; 

0 sequence detemiination of the PCR product to obtain a sequence of the PCR product; 
25 g) comparing the sequence against a genomic database to assign a locus for said 

epigenetic abnomtality associated with a non-Mendelian disease; 

h) searching said database to identifir a gene located proximal to said locas; 

h) comparing expression patterns of said gene located proximal to said locus within a test 

sample that exhibits characteristics of said non-Mendelian disease with expression 
30 patterns of a corresponding gene within a control sample to identify said gene having an 

epigenetically altered expression pattern. 
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Genes can be idemified in accordance with the preseni invention from any eukaryotic 
organism including, plants and animals, where epigeneticabnormahv 
the occiimnce of non-Mendelian disease. 

In yei another aspect, the present invemicm provides » method of isolating a probe for 
detecting an epigenetic abnonnality associated with a non-Mcndetian disease in an 
animal, said method conqirising: 

a) extraction of genomic DNA from a sample that exhibits characteristics of a non- 
Mendelian disease; 

b) digesticm of said genomic DNA with a meihylaiion^scnwtivc restriction enzyme to 
produce a pool of restricted DNA fragments; 

c) fractionation of said pool of restricted DNA fragments to obtain DNA fragments of a 
desired size; 

d) amplification of at least a segment of said DNA fragments of a desired size with 
primers that anneal to an endogenous DNA elemem to produce a PCR product; 
0 using said PCR product as said probe to deteci said epigenetic abnoimaltty associated 
with a non-Mendelian disease in another sample. 

20 TWs summary doesnotnecessanly describe all necessary features of the invention 

but that the invention may also reside in a sub-combination of the described features. 
BRIEF DESCRIPTION OP THE DRAWINGS 

These and other features of the invention will become more apparent from the 
25 following description in which reference is made to the appended drawings wherein: 

FIGliRE 1 shows the localization of the cloned Ahi elements. 

FIGURE 2 shows coding genes that are located in the vicinity (writbin 100,000 bp) of 
30 cloned Alu elements. 

FIGURE 3 shows sequences of the cloned Alu elements. 
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DESCWFTION OF PREFERBED EMBODIMENT 

The iDventioa relMS to methods and compostrions for sdenrificarion of 
epigenedc abnonnaliiies. More fivticularly. the present invention relates to diagnosis of 
diseases based on DNA xnetbyiadon differences and identification of genes thai canse 
such diseases. The present invention provides methods and compositiMs for detecting 
and isolating DNA sequences which aie abnormally or differentially methylated in a 
diseased cell t^e when compared to a normal cell type. 



Traditicmal linkage snidies in complex diseases such as schizophrenia, bipolar disorder, 
cancers and diabetes have only succeeded in isolating chromosome regions, often 
containing 200-300 genes. The ability to screen such a large number of genes is clearly 
a nme-consuming and daunting task. The present invention provides a shon-cut in 

15 deteimining which genes within a 200-300 gene region are in fact Tespo«>sible for the 
onset of a major disease such as diabetes, schizophrenia^ cancets. or bipolar disorder. 
According!© the present mveniiondiffeientiallymodificd, endogenous mute^ 
elements can act as maA&s for genes which are dys-regulated. Epigenetic analysis of 
so called "jotflc" DNA leads to a *shon-cui* in identification of specific genes, dys- 

20 regulation of which increases the risk to mSQor disease. 

The following description is of a prefiened embodiment by way of example only 
and without limitation to the combination of feamres necessaiy for carrying rtie invention 
into effect. 

25 

The methylation paiiems of DNA from nunor cells are generally difTcrem than those of 
nomtal cells (Laird et al. PNA Methylation and Cancer* 3 HUMAN 
MOLECULAR GENETICS 1487, 1488 (1994)). Tumor 

cell DNA is generally undenncmylaied relative to normal cell DNA. but selected regions 
30 of the turooT cell genome may be more highly methylated than the same regions of a 
normal cell's genome. Hence, detection of aliared methylation patterns in the DNA of a 
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tissue sample is an indication ihai the tisnie Is cancerous. For example, ihe gene for 
InTOlin-UkcQrowthFaciot2(lGF2)ishypomHhylatedmanumberofcancetoufiti^ 

such as Wilm's Tumors, riwbdomyosarcoma, lung cancer and 
hepatoblastomas (Rahmer ei a!. 362 NATURE 747^9 (1993); Qgawa, et al.. 362 
5 NATURE 749-51 (1993); S. Zhan et al.. 94 J. CUN. INVEST. 445^8 (1994); P. V. 
Pedone et al. 3 HUM. MOL. GENET. 1 117-21 (1994): H. Suzuki ei al.. 7 
NATURE GENET432-38(1994);S. Rainier et at. 55 CANCERRES. 1836-38(1995)). 

AUeraiion of meibylation may be a key. and common event, in the development of 
10 neoplasia and may play at least two roles in tumoiigenesis: 

1) DNA hypomethylaiion may cause an increase in proto-oncogene expression or DNA 
l^imediylation may decrease expression of a tumor supressor which contributes to 
neoplastic growth; and 

2) DNA hypomeihylation may change chromatin sirucmre, and induce abnonnalities in 
15 chromosomcpairingand disjunction. Such smiciuralabnonnalltiea may result ingenomic 

lesions, such as chromosome deletions, amplifications, inversions, mutations, and 
translocations, all of which are fbund in human geneiic dbeases and cancer. 

While the pi«sent invention can be used for detecting aiv altwation in 
20 meihylation. the piesem invention is parricularlyuseftil for detecting and isolating DNA 
fiagmems that are noimally methylated but wWch. for some leaeotu are non-methylated 
inapmpoition of cells. Such DNA ftagmems may iwrmally be methylated foranumber 
of reascms. For example, such DNA fiagmenis maybcnoimally methylated because they 
contain, or aie associated with, genes that are rarely expressed, genes that are expressed 
25 on^ during early development, genes that are expressed in only certain cell-types, and 
the like. 

AS used herein, hypomeihylation means that at least one cytosine in a CG or CNG di- ot 
tri-nucleoUde site in genomic DNA of a ^ven cell-type does not contain CH3 at the fifth 
30 positionof thecytosinebascCelltypeswhichroayhavehyporoethylated COsorCCGs 

mclude any cell type which may be expressing a non-hottsekeeping flmciion. This 
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includes both mmnal cells thai expiess tissue-specific or cell-type specific fieneric 
fimctions. as well as tumoions, cancerous, and similar cell ij^ies. Cancerous ceU types 
and conditions which can be analyicd. diagnosed or used lo obtaining probes by the 
piesent methods include, but are not limited to. Wibn's cancer, breast cancer, ovarian 
5 cancer, colon cancer, Wdney cell cancer, liver cell cancer, lung cancer, leakonia, 
thabdomypsarccHna, sarcoma. «ad hq»toblastoma. 

A method of the present invention is directed to detection of epigenetic 
abnormalities associated with a non-Mendclian disease and comprises extraction of 

10 gcnomicDNAftomanon-MendcHandi3caac8aniple,auchasdisea8cdtisaHeordiseased 
population of cells; hydrolysis of this DNA with methylation-sensitive restriction 
enzymes, and subsequent fiactionation of DNA fiagroents and purification of DNA 
ftagmenis of a desired size, for example, but not Umtted to. shoito^ than 10 KB. These 
purified DNA flagmems are funher subjected to PCR amplification usrog primers that 

1 5 hybridize to endogenous multi-copy DNA elements inchidiitg. but not limited to, ALU 
or LI elemoits. After dui^ PCR products of such elements are cloned and sequenced 
using standani molecuUif biology techniques known to die sWUed anisan and the 
reliant sequences are m^ped on the genome using any commercially or publicly 
availia>le human gsnome database. These cloned multi-copy elements indicate a loci of 

20 putative epigenetic abnoimalhy or epigenetic dys-regulation and indicates genes that 
predispose a panem to a complex. non-Mendelian, multi-factorial disease, such as, but 
not limited to, cancers, diabetes, schizophrenia, or bipolar disorder. 

By flie term •^on-Mendelian disease" is meant any disease which etiologically requires 
25 more than a single genetic abnonnaliiy. As such a non-Mendelian disease requiies more 
than one factor, or in other words, is multi-factorial, and may comprise ej^genetic 
alterations or abnoraialities. 

Epigenetics relates to higher order gene control mechanisms in eukatyotes that activate 
30 or repress parts of the genome via changes in chromatin suucture. These hitfusr order 
gene control mechanisms fonn an Important molecular basis of cell differentiation. Any 
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changes in an organism brought about by alterations in ttie action of graes, where the 
changes do not require occurrence of any mutations, are called epigenettc changes. An 
epigenetic abnoimality occurs when an epigenetic change contributes or predisposes 
normal cells into becoming diseased cells. DNA methylation is an example of an 
qiigenetic mechanism. The tcnn DNA methylation refers to the addition of a methyl 
group to the cyclic carbon S of a cytosine nucleotide. A family of conserved DNA 
methyltransferases catalyzes this reaction. Normally. DNA methylation can be used, for 
example, but is not limited to, to methylate the transcription unit of a gene so thai the 
gene is turned off or silenced, and a corresponding protein product is not produced in a 
particular cell. For instance, one of the two X chromosomes in female mammals is 
inactivated or silenced by methylation. 

DNA is eximcted fiom a non-Mendelian disease sample uting standard techniques, 
known in the an. for isolating DNA from various samples such as cells , tissues, or 
oigans, or other suitable specimens. Standard techniques for isolating DNA have are 
disclosed in reference textbooks or manuals such as Sambrook, Frilsch> and Maniatb, 
Molecular Cloning: A Laboratory Manual (1989), Cold Spring Haibor. 

DNA may be extracted from any sample that may have epigenetic abnoimalities 
associated with a non-Mendelian disease or any sample diat exhibits characteristics of a 
non-Mendelian disease, for example, but not limited to cells of the following tissues: 
Epithelial Tissues, Exocrine Glands, Endocrine Glands, Connective Tissues, Adipose 
Tissue, Cartilage, Bone, Blood, Muscle Tissues comprising Smooth, Skeletal or Cardiac 
Muscle Tissue, or Nervous Tissue comprising Brain Tissue. 

Any methylaiion-sensiiive restriction enzyme may be used for the purposes of this 
invemion. The terms "restriction endonucleases** and "restriction enzymes" refer to 
bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
nucleotide sequence. The process of cutting or cleaving the DNA is referred to as 
resuiction digestion. The products of a restriction digestion are referred to as restriction 
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piodncts. A restriction enzyme used in llie present invention may yield restriction 
pioducts having bluni-cnds or oveitianglng "aicky" ends. Specifically, a restriction 
en^e can symmetrically cut both strands of a double stranded UNA fragment to 
produce a blimt-^ed fragment, or a resnictiQix wtzyme may assymetrically cleave the 
S two strands of a DNA fra^em to produce a DN A fragment that has a single stranded 
ovohang. In general, a methylation-sensuive restriction enzyme used in the present 
invenHon wiH recognize and cleave a non-mcUiylated sequence, while it will not cleave 
acotresponding methylated sequenfce. Mediylation of plam and mammalian DNAoceuis 
at CG or CNQ sequences. This meihylation may interfere with the cleavage by some 
10 restriction endonuoleases. Endonucleases that are sensitive and not sensitive to m*CG or 
m^CNG methylation, as well as isoschizoraers of methylalion-sensiUve restricdtm 
endonucleases that recognize identical sequences but cBffer in their sensiavity to 
methylation. can be extremely useful for studying dte level and distribution of 
methylauon in eukatyotic DKA. Examples of methylaiion-sensitive testrictionenzymes, 
15 and corresponding restriction site sequences, that can be used accoiding to the present 
inventloninclttde,botarenoi limited to: Aatn(GACGTC);Bshl236I(CGCG);Bshl285l 

(CGRYCG); BshTI (ACCGGT); Bsp6n(TCGCGA); Bspll9I fTTCQAA); BspMBU 
(RCCGCY); BsulSI (ATCGAT); Cfrlffl (RCCGGY); Cfi42I (CCGCGG); Cpol 
(CGOWCCG); Eco47ffl(AGCGCT);Eco52I(CGGCCG);Eco72l(CACGTG): EcolOSI 
20 (TACGTA); Ehel iGGCGCC): Esp31 (CGTCTC); FspAI (RTQCGCAY); Hinll 
(GRCGyC);Hin6l(GCQC);Hpan(CCGG);Kpn2I(TCCOGA).Mim(ACGCGT);NotI 

(GCGGCCGC); Nsbl (TGCGCA); Paul (GCGCGC): Pdil (GCCGGC); Pfl23U 
(CGTACG): Pspl406I (AACGTT); Pvul (CGATCQ); SaU (GTCGAC); Sroal 
(CCCGGG): Smul (CCCGC); Tatt (ACGT); or Taul (GCSGC). 

25 

Size fractionation and purification of reslrictedDNA fragments can beperformedby any 
method known in the art, for example, but not Umited to, separation of DNA fragments 
of a desired size such as fragments of less dum 10 kB by centrifiigalion of a DNA 
fragment pool through a membrane or oflier suitable matrix having size exclusion or 
30 inchisionproperties. Alternatively, apoolof restricted DNA fragments maybe separated 
using agaroseof polyacrylamide gel electrophoresis and DNAfiragmenisof adesired size 
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may be purified any suitable gel-extraciion composition such as ^ass milk or 
Quatenuoy ammonium ions. The desired dze limit of the fiactionated and isolated DN A 
fragments depends on the size of die endogenous DNA element that saves as a template 
for PCR amplification. As such the **ONA fragments of a desired size" can be any size 
5 as long as they arc larger than, and can ther^re comprise the endogenous DNA element. 

As used, the tenns "amplification/* "ampli^," or "amplifying/' are defined as the 
production of additional copies of a nucleic acid sequence and is generally carried om 
using polymerase chain reaction (PCR) or other technologies well known in the art (e.g., 

10 Dieffenbach andDvckslcr, PCR Primer, a Laboratory Manual, Cold SpringHarbor Press, 
Plainview NY [1995]). Nucleic acid amplification techniques allow fi>r increasing the 
concentration of a target or template sequence, or a portion or segment thereof Ijom a 
mixnire of genomic DNA withom cloning or purification. A review of current nucleic 
acid anqilification technology can be found in Kwoh et al, 8 Am. BiotechnoL Lab. 14 

15 (1990). In vitro nucleic acid amplification techniques include polymerase chainieaction 
(PCR), transcription-based amplification system (TAS). self^ustMued sequence 
replication system (3SR), ligation anqslificauon reaction (LAR). ligase-based 
amplification system (LAS), Q.bcta. ttN A replication system and run-off transcription. 
All present and fiiiure nucleic acid amplification technology can be incorporated into the 

20 present invention. 

PCR is a preferred method for DNA amplification. PCR synthesis of DNA fragments 
occurs by repeated cycles of heat denaturaiion of DNA fragments, primer annealing onto 
endogenous sequence elements or exogenous adaptor ends of a DNA fragment or other 

25 suitable DNA template* and primer extension. These cycles can be perfonned manually 
or, preferably, automatically Thennal cyclers such as the Peridn-Elmer Cctus cycler are 
specifically designed for automating the PCR process, and are prefeired. The number of 
cycles per round of synthesis can be varied from 2 to more than 50, and is readi^ 
determined by considering the source and amount of the micleic 

30 acid template, the desired yield and the procedure fi>r detection of the ^diesized DNA 
fragment. 
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PCR techniques and many variations of PCR are known. Basic PCR techniques are 
described by Saiki et al. (1988 Science 239:487-491) and by K.B. MuUis in U^. Pat. 
Nos. 4,683,195^ 4,683,202 and 4,800,159, wMch ate incorporated herein by reference. 



5 The conditions generally required fbr PCR indude tenqieratuie, salt, cation, pH and 
telated conditions needed for efficient amplification of at least a segment or portion of 
a DNA fragmem t^plate. PCR conditions include repeated cycles of heat denaturatioii, 
and incubation at a temperature pemiitting primer hybridization to an endogenous 
sequence elements or exogenously llgated ad^tors> and copying of the DNA fiagimem 

10 by the amplification enzyme. Heat stable amplification enzymes like the pwo, Thennus 
aquaticus orTheimococcus liloralis DNA polymerases are coitmiercially available which 
elitninate the need to add enzyme after each denaturation cycle. The salt, cation, pH and 
related factors needed for enzymatic amphfication activity are available fifomcommercial 
manufacturers of amplification enzymes. 

15 

As provided herein an amplification enzyme is any enzyme which can be used for in vitro 
nucleic add aipplification, e.g. by the above-described procedures. Amplification 
enzymesmaybetheimostableorthermolabile. Sucharoplificationenzymesinchidepwo*^ 
Estdienchia coli ON A polymerase 1. Klenow firagment of E. coli DNA polymerase 1, T4 

20 DNA polymerase. T7 DNA polymerase, Thermus aquaticus (Taq) DNA polymerase. 
Thermococctts litoralis DNA po^nnerase, SP6 RNA polymerase, T7 RNA polymerase. 
T3 RNA polymerase, T4 polynucleotide kinase. Avian Myeloblastosis Virus reverse 
transcriptase, Moloney Murine Leukemia Virus reverse transcriptase, T4 DNA ligase, E. 
coli DNA ligase. Vent polymerases, or Q.beia. replicase. Prefeired amplificanon enzymes 

25 are the pwo and Taq polymerases. The pwo enzyme is especially preferred because of its 
fidelity in replicating DNA. 

With PCR, it is possible to amplify a single copy of a specific target sequence in genomic 
DNA to a level detectable by several different mediodologies (e.g., hybridization with a 
30 labeled probe; incorporation of bioimylated pnmers followed by avidin-enzyme 
conjugate detection; incorporation of 32P-labeled deoxynucleotide triphosphates, such 
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as 4CTP or dATP, into the amplified seenient). In addition to genomic DNA. any 
oligonucleotide sequence can be amplified with the appropriate set of primer molecules. 
In particular, the amplified segmaats created by die PCR process itself are, themselves, 
efficiem templates for subsequent PCR amplifications. 

5 

By *e term "primer" is meant an oligonucleotide, whether occurring naturally as in a 
purified restriction digest or produced synthetically, capable of acting as a point of 
initiation of synthesis when placed under suitable conditions in which synthesis of a 
primer extension product that is complementary to a nucleic acid strand is induced. Such 

10 suitable conditions comprise nucleotides and an amplification enzyme such as DMA 
polymerase and a suitable temperamre, salt concentrarion, and pH). The prhner is 
preferably smgle stranded formaxiraum efficiency in ampUficatian,batmayaltematively 
be double stranded. If double stranded, the primer is first nreated to separate its strands 
before being used to prepare extension products. The primermusi be sufficiemly longto 

1 5 prime Ae synthesis of extmsion products in the presence of the inducing agent. The exact 
lengths of the primers will depend on many factors, including temperature, salt 
concentration , pH, source of primer and the use of the method. The primers of the 
present invention can hybridize or anneal to a sequence element diat is endogenous to a 
DNA fragment template or the primers can anneal to exogenous adaptor sequence 

20 elements fltat have been ligared to the ends of a DNA fragment template. Preferably, the 
primers anneal to an endogenous multi-copy DNA sequence element, for example, long 
or shon interspersed nucleotide elements (LINEs or SINEs).. 

Endogenous mults-copy DNA sequence elements are repetitive DNA sequences that 
25 together are estimated to comprise 30% of total genomic sequences. Present at between 
10 - 105 copies per genome these multi-copy elements can be (bund throughout the 
euchromatin and have been categorized as: 

a) nucrosatelliies / minisatelUtes (VNTR. DNA 'fingerprints) 

b) dispersed-repetitive DNA, mainly transposable elements (UNES/ SINES) 
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Also includes *vedundanr genes for hifiioncs, and ribosomai UNA and proteins, 
(gene-products jmssent in cell in large numbers). 

Many multi-copy DMA elements may be involved in regulation of gene ^pression as 
diey have been shown to be mterspersed within single-copy sequences and have been 
shown to be located adjacent strucmral genes. 



Long and short interspersed nucleotide elements (UNEs and SJNEs). are represented in 
humans mainly by LI (Furano AV. The biological propenics and evolutionary dynamics 

10 of manimahan LINE-1 rctrDiransposons. Prog Nucleic Acid Res Mol Biol. 
2000;64:255-94) and Alu elements (Watson et al., MolecularBiology of the Gene, four* 
edition (1987) pp. 669-670), respectively. Both types of elements arc consdered to be 
retroiransposable (ie, can replicate via an RNA copy reinserted as DNA by reverse 
transcription) and they have significant roles in g^mic function. The insoted elements 

15 can be full length or truncated, or n»y be reaiianged relative lo fbll-lengdt elements. 

The most common and best characterised LINE is LI . having the following properties: 
Repeated approximately SOOOO times in the human genome (0 5% of total) 
Only about 3000 of diese are full length; the remainder are truncated, mostly at 

20 the 5' end. 

• Full length element is about 6kb in size and contains two open reading frames, 
one of winch encodes a reverse transcriptase. 

• AT-rich region is located near the 3' end of the element, 

• Element is flanked by two short direct repeats. 

25 

The main type of SINE is the Alu family, characterized as follows: 

• usually contain a target for the restriction enzyme Alu I; 

5 X 10* - 10* copies in the haploid genome, with an average of one repeat eveiy 
4 to 5 kb (I - 10% total); 
30 - Often present in the transcription unit of a gene, within inirons and occasionally 
in non-translated regions of the mBNA; 
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Generally eontain 300bp consensus sequence wluch consist of two tandem 
repeats of a 130bp sequence, one of v/hich has a 32bp deletion, as such Ala 
ftmtly members are recognizably related in sequence, bui noi precisely 
conserved: 

S • Elements are flanked by direct repeats; 

• Each repeat unit has an AT-iich region that suggests a poly A tail; 
5' end resembles a pol HI promoter region. 



LINES and SINEs both have a poly(A) tail which may act as a template for reverse 
10 transcription from nicks made at the site of insertion in the host DMA by a UNE-encoded 
endonuclease. 



Primers of the present mvention may be designed according to any LI or Ato sequence. 
For example, various analyses (Claverie,JJ4- andMakalowski,W, Aln alen,Naturc371, 
15 752 (1994)) indicate that Alu repeats fail into 8 $ubfamihes» and therefore, 8 ALU 
consensus sequences have been constituted and added to GehBank as accession numbers 
U14S67,U14568.U14569.U14570,U14571,U14572,U14573 andUl4574. Apiimer 
of the present invention may be designed in accordance with any of these consensus 
sequences. For example, the dqiositcd consensus sequence ofa subfamily of Alu repeats 

20 designated U14S70 is as follows: 

GGCCGGQCGCQOTGGCTCACGCCTGTAATCCCAGCACTTTGGQAGGCCGA 

GGCGGGTGGATCATGAGGTCAGGAGATCQAGACCATCCTGGCTAACAAGQ 

TGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGCGGTG 

25 Products of amplification reactions can be subjected to sequence determinations. 
Amplification products, preferably PCR products, can optionally be cloned into a vector 
before sequencing. When not cloning a PCR product, an adaptor DNA elements can be 
ligated to the ends of PCR products, and the PCR products can be sequenced using a 
primer that anneals to the adaptor clement. Cloning, ligation, and sequencing can be 

30 perfoimed using standard techniques , such as protocols described in textbooks or 
manuals such as Sambrook. Fritsch and Maniatis, Molecular Cloning: A Laboratory 
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Manml. 1989. a1so» ccwmiercially available Wis maybe utUized-Anoi^ 
sequence detenninattOA are automated DNA sequendng systems and methods. 



5 Nucleic acid sequences of amplification products isolated according to methods of 
piesent are disclosed in Figure 3. The region of the chromosome to which a given 
sequence is located may be detenntned by hybridization, including, but not limited to 
PGR amplification methods, or by database searching. 

1 0 Hybridization methods and conditions arc well known in the art- Nucleic acids that are 
identical to the provided nucleic acid sequences, bind to the provided nucleic acid 
sequences (disclosed in Figure 3) under stringent hybiidization conditions. By using 
pn>bes, particularly labeled probes of DNA sequences, one can determine a region of 
chromosome where a given sequence is located and thereby establish chromosomal loci 

IS for epigeneiic abnormalities associated with a non-Mendelian disease. 

Preferably, hybridisation is p^ormed using at least 1 5 contiguous nucleotides from any 
sequence identifiedby the methods of the present invention including, but not limited to. 
sequences disclosed in Figure 3, Tlie probe will preferentially hybridize with a nucleic 

20 acid comprising a complementaxy sequence to the probe, allowing the 

idoitification of the chromosomal region of the nucleic acids of the biological material 
that uniquely hybridize to the selected probe. Probes of more than 1 5 nucleotides can be 
used. e.g. probes of from about 18 nucleotides up to the entire length of the provided 
nucleic acid sequences, but 15 nucleotides generally represents sufficient sequence for 

25 unique identification. 

As mentioned above once the sequence (or a portion of the sequence) of a multi-copy 
DNA element has been isolated, this sequence can be used to map the location of the 
multi-copy DNA element on a chromosome. Accordingly, nucleic acids of the invention 
30 described herein or fiagments thereof, can be used to map the location of multi-copy 
DNA elements of the invention on a chromosome. The mq^ping of the sequences of 
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nueleic acids of die invention to chromosomes is an impoxtam first step in correlating 
diese sequences with genes associated with disease. 

Briefly, sequences of die invention, for example, sequences disclosed in Figure 3, can he 
mapped^chtomosomesbyprqiaringPCRprimers (preferably 15-25 bp in length) finom 

the sequences of nucleic acids of the invention. These primers can then be used for PCR 
scrccningofsoraaticcell hyhridscomaining individual human chromosomes. Onlyibose 
hybrids containing the human sequence corresponding to the sequences of imcleic acids 
of the invention will yield an amplified fi^gment 



Somatic cell hybrids are prepared by fusing somatic cells from different mammals (e.g.» 
human and mouse cells). As hybrids of human and mouse cells grow and divide, they 
gradual ly lose human chromosomes m random order, but retain the mouse chromosomes. 
By using media in which mouse cells cannot grow (because they lack a particular 

15 enzyme), but in which human cells can, the one human chromosome diat contains the 
gene encoding a needed enzyme, depending on the media, wtU be retained. By using 
various media, panels of hybrid cell lines can be established. Each cell line in a panel 
contains rither a single human chromosome or a small number of human chromosomes, 
and a foil set of mouse chiomosomes, allowing easy mapping of individual sequences to 

20 specifichuman chromosomes. (D'Eustachio et al. (1983) Science 220:919-924). Somatic 
cell hybrids containing only fiagments of human chromosomes can also be produced by 
using htunan chromosomes with translocations and deletions. 

PCR mappmg of somatic cell hybrids is a rapid procedure for assigning a particular 
25 sequence to a particular chromosome. Three or more sequences can be assigned per day 
usmg a single thermal cycler. Using the sequences of nucleic acids of the invention lo 
design oligonucleotideprimers. sublocalizaiion canbe achicvedwith panels of fiagments 
fiom specific chioinosomes. Oiher mapping strategies which can similarly be used to 
map a sequence of a nucleic acid of the invention to its 
30 chromosome include in situ hybridization (described in Fan et al. (1990) Proc. Nail. 
Acad. Sci. USA 87:6223-27). pre-screening with labeled flow-sorted chromosomes. 
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pre^leciionbyhybridizationtodiroinDBOine specific cDNA Kbraries, andfiearcWngof 
genomic databases. 

Once Ihe sequence (or a pomcm of ihe sequence) of a multi-copy DMA dement has been 
5 isolated, this sequence can be used to map the location of the gene on a chromosome by 
eearchingagenomic database, for e>taniple,buinoi limited to,ahiiman genome da 

(Www.Benome.uc8c.edu/). Several genome databases are also available from Cetera 
Corp. or rtieNational Center for Biotechnology Information (NCBI). Genome databases 

can be searched by comparing the known quciy sequence or reference sequence with 
10 genomic sequences aiored and annouted in a database, and selectinB sequences from d» 
database that have a high similarity, preferably greater than 80% similariiy. with the 
queiy or reference sequence. Sequence srorilaniy is calcnhlted based on a lefocncc 
sequence, which may be a subset of a largpr sequence, such as a conserved motif, coding 
region, ttanking region, etc. A reference sequence wi« usnaUy be at least about 18 
IS contiguous nucleotides long, more usually at least about 30 nucleotides long, and may 
extend to the complete sequence that is being compared. Algorithms for sequence 
analysis are known in the ait, such as BLAST, described in Altschtfl et aL, J. Mol. Biol. 
(l990)21S:4d3-lO. 

20 To deterojine whether a nucleic acid exhibits similarity with the sequences presented 
herein, oligonucleotide alignment algorithms may be used, for example, but not limited 
to a BLAST (QenBank URL: www.ncbi.nlm.nih.gov/cgt-bin/BLAST/, using default 
parameters: Program: blasm; Database: nr; Expect 10; filter, defeult; AUgnment; 
pairwise: Query generic Codes: Siandard(l)).BLAST2(EMBLURt: http://www.enibl- 

. 25 heidelberg.de/Scrvices/index.htrolusingdcfaultpaiameters:MatrixBLOSUM62:Filter. 
deault, echofilten on. Expcci:JO, cutoff: default; Strand: boflt; Descriptions: SO. 
AUgnmenis: 50), or FASTA, search, using default parameters. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a roetaphase 
30 chromosomal spread can further be used to provide a precise chromosomal location in 
! step. Chromosome spreads canbemade usingcelU whose divisionhasbecnblocked 
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in m^hase by a chemical, eg., colcemid ihar disnq>is the miioTic spindle. The 
cteoniosoiiMSBcanbe treated brieflywihttypsiiu and ihensminedwiihGiemsa^ 
of light and da* bands develops on eadi chromosome, so that the chromosomes can be 
idajtified individually. The FISH vchnique can be used with a DNA sequence as short 
S as 500 or dOO bases. However, clones larger than 1 ,000 bases have a higher likelihood 
of binding to a unique chromosomal location with sufficient signal intensity for simple 
detection. Preferably 1,000 bases, and more prcfwably 2.000 bases will suffice to get 
good results at a reasonable amount of time. Fwr a review of this technique, see Vwma 
et al., (Human Chromosomes: A Manual of 
10 BaaicTochmques(PersamonPr«ss,NewYorlc I08g)). Sequences ofisoteled multi-copy 
DNA elements of the present invention that are shorter than 500 bases can be extended 
byanysuitable technique, for example, aknown sequencecanbe extended by atechnique 
of genomic sequencing using a primer designed according to the known sequence. 

15 Reagents fbr chnmiosome mapping can be used individually to mark a single 
chiomosome or a angle site on that duomosome. or panels of reagents can be used fat 
marking multiple »tes and/or multiple chromosomes. Reagents conesponding to 
noncoding re^ns of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved vrifliin gene {amilies. thus increasing the 

20 chance of cmss h^uidizaticHis during chromosomal maj^nng. 

Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. 
(Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man. 
25 available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and disease, mapped to ihe same chnmiosomal region, can 
then be identified through linkage analysis (co-mheritancc of physically adiacem genes), 
described in, e.g.. Egeland et al. (1987) Nature 325: 783-787. 



30 



Probes specific to the nucldc adds of the mvention can be generated using a whole or 
poitton of the nucleic acid sequences disckised in Figure 3. The probes can be 
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synthesized chanicany or can be generated fiwn longer nucleic acids using lestticiion 
enzymes. The probes can be labeled, for example, with a radioactive, bioiinylaied. or 
fliunescent tag. Piefciably, probes are designed based upon an idenii^g sequence of 
a nucleic acid of one of Figure 3. More preferably, probes are designed based on a 
contiguous sequence ofoneofUiesulgectmicldc acids that remain uninasked following 

apptication of a masking program for nuudcing low complexity (e.g., XBLAST) to the 
sequence.. i.e. one would select an unmasked region, as indicated by the nucleic acids 
outside tbe poly-n stretches of the masked sequence produced by the masking program. 
Probes are not only useful for deiennining chromosomal locationofasequence. but also 

canbeused to determine whether an cpiginiciicabnonnaUty exists in amothcr sample, for 
example a test sample obtained from a cukaryotic organism that exhibits symptoms of 



a non-Mendelian disease. 



Once a chromosomal locus has been assigned ID a multi-copy DNA element obtained by 
IS thepreseni invention, a genomicdatabaseor geneticmapdatacanbeusedioidentifyone 
or more genes thai are proximal to the assigned chromosomal locus, preferably the 
identified one or more g^es are physicaUy adjacew to the assigned locus. Expression 
patterns of the genes In a non-Mendelian disease sample can then be compared agwnst 
the expression pattern of correspondmg genes in a control sample to identify a gene 
20 havinganepigcneticallyaltcred expression panem. The non-Mendelian disease sample 
and th«5 control sample can be obtained ftom within the same organism, for example, 
without wishing tobe limiting, expressionofagene within cancerous kidney cells could 

be conQiared against expression of a conresponding gene in a non-cancerous kidney cell 
of the same organism. Alieroately. the non-Mendelian disease sample and the control 
25 sample can be obtained ftom different organisms. For example, without wishing to be 
limiting, expression of a gene in a prefrontal cortex sample from a schizophrenic 
individual can be compared against expression of a conresponding gene in a preftontal 
cortex sample from a different non-schizopbrenic individual. 

30 TechniquesfordeterminingexpressionpatieinsofgenesareweUknownintheart. For 
example, gene expression patterns eanbe establifdied usmgNorthero analysis, reporter 
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constructs such as 0FP» quantitative PCR amplification, or DNa chip amlysis. If. for 
example, gene expression within a sample is determined using DNA chips, the mRNA 
firom the sample is extracted, reverse transcrihed to the corre^onding cDN A» amplified, 
fluorescently labeled and allowed to hybridize with the sequences on a chip. 
S Sequence-specific labels are captured on the fitirface of ttie chip. By reading the 
Oiiorescence» one can determine which of the genes were expressed and at what levels. 
DMA chip analysis is provided by several companies, for exanqile. but not limited to. 
Af^meirix and Naitogen. DNA chip technology is an eifeciive method for detennining 
expression patterns of genes and semiconductor fabrication technology has allowed for 
10 the packing of ^usands of gene sequences Uno square centimeter surfaces. Use of 
reponer construct^ Northern analysts, and quantitative PCR amplification are equally 
effective ahematives. 

IS Potcnrial therapeutic approaches . 

Detection of epigenetic abnonnalities associated with non-Mendehan diseases including, 
but not limited (o schizophrenia, diabetes, cancers and bipolar disorder may le^d to 
iimovative DNA modification-based iherapies. Recently a compound protein consisting 
of a DNA methy lation enzyme and a zinc-finger protein was constructed (Xu G-L, fiestor 

20 m Nanure Genetics 17: 376-379. 1997). The mechanism of action of the protein 
consists of Ae recognition of a specific DNA sequence by tiie xinc-finger protein that is 
^ecific for that sequence and subsequent modification of the surrotmding cytosines by 
ONAmodificauon enzymes. Aspecific protein withDNAmodtfication enzyme lestoring 
flte Aomial pattern of DNA methylation can be generated. The blood-brain hairier has 

25 been a major obstacle for the bloodbome genetic constructs to reach the brain, but a 
recent study demonstrated that pegylated neutral liposomes, unlike caiionic ones^ are 
stable in blood, do not get entr^iped in the limg, and are able to efficiently deliver 
plasmid DNA through the blood brain barrier to the various sections of brain tissue . 



30 



The present invention provider methods and compositions for detecting DNA elements 
that act as a marker for the specific dysfonctional genes and ai the same time identify the 
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q)ecific genes involved in a aon-Mendelian diseases. Sueh infiamaiion would lead 
quickly lo «be developmeni of a du^sdc test for such diseases, thai could be 
incoipoiaied inio a diagnostic Wt Further leseaich on specific genes may also lead to 
mattment options forpeoplefiufferingfiomanon.Mcndeli»diseafie through enherge^^ 

5 ther^ywoilc or tiuough targeted drug developmem. 

The heuristic value of epigenerics in non-Mendelian diseases, including scluiophrenia, 
derives from numerous important characteristics of epigeneric regulation of ^es 
(Petronis A. Human morbid genetics revisited: relevance of epi^enes. Trends Genei. 
10 2001 Mar,l7(3):l42-6).Thcepigeivetieresewdiprogram»ndicaiesfl»tn:gulationofgcne 
activity is critically important for noixnal fimctioiung of the genome. Genes, even the 
ones that cany no mutations or disease predisposing polymorphisms, may be useless ta 
even haimfid if not expressed in the appnqniate amount, at the right time of the cell 
cycle, or in the right compartment of the nucleus. Epigeneiic mechanisms, more so tiian 
15 DNA sequouie^iased ernes, can exphnn a scries of phenomenolo^cal features of a non- 
Mendelian disease, for example, in the case of, ras«orpsychosis mcluding: i) relatively 
late agfe of onset and coincidence of the first symptoms with changes in the hormonal 
status in tiie organism; H) sexual dimorphism-, iii) fluctuating course and sometimes 
recovery; Iv) parental origin effects; and v) discordance of M2 twins. We also 
20 re-analyzed several etiological theories of major psychosis from an epigenetic point of 
view (Pelronis A, Paierson AD, Kennedy JL. Schizophrenia: an epigeneric puzzle? 
Schizophrenia Bulletin 25:4: 639-655, l999;Petronis A. The genes formajor psychosis: 
abeirant sequence or regulation? Neuropsychophaimacology, 23(l):t-l2; 2000) and 
suggested that epigenetic mechanisms have the potential to explain a number of clinical 
25 and molecular findings tiiat traditionally have been supporting unrelated and somewhat 
antagonistic theories of schizophrenia and bipolar disorder, or have not been explained 
at all. With regards to the field of ncurobehavioral disorders the heuristic value of die 
epigenetic model of major psychosis lies in the possibility of integrating a wide variety 
of empirical data into a new ti»eor«tlcal fiamewoik, which provides die basis for new 
30 experimental s^proaches. U is important to note that epigenetic dysfunction may exhibit 
stability during meiosis and therefore can be transmitted fi»m one generation to another 
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(Klar AJ. Propagating epigcnetic states Ihroueh meiosia: whete Mendel's gene is more 
than a DNA moiety. Trends Genet 1998; 14(8):299.301; Cavalli G. Paro R. The 
Drosopliila Fab-7 chromosomal element conveys epigenetic inheritance during mitosis 
and meiosis. Cell 1998; 93(4):505-18; Allen ND. Norris ML. Surani MA. Epigeneric 
5 control of transgene expression and imprinting by genotype-specific modifiers. Cell 1990 
Jun 1 ;6l(5):853.6l ; Silva AJ, While R. Inheritance of allelic blueprims for meihylation 
patterns. Cell 1988 Jul 15;54t2):145-52; Morgan HD, Sutherland HG. Manln Dl, and 
Wbitelaw E (1999) Epigenetic inheritance at the agouti locus in the mouse. Nature 
Genetics 23: 314-8), which would simulate filial. i.e. genetic, cases of the disease. 

10 

The above description is not intended to limit die claimed invention in any 
manner, Fordiennore, the di»:ussed combination of features might not be absolutely 
necessaiy fiar the mventive solution. 

IS 

The present invention will be further illustrated in the follov/ing examples. 
However, it is to be understood that these examples are for illustrative purposed only, and 
should not be used to limit the scope of the present invention in any manner. 

20 Examples 

Example I: Detection of epigenetic abnormaUties associated with schizophrenia or 
bipolar disorder. 

25 Identification of the actual genes, which are epigenetically dysregulated and increase the 
risk to m^jor psychosis, is not a simple task. Potentially any of die 3S.O0O human genes 
can be an epigenetic candidate for schiaophrenia and bipolar diwrder Based on our 
preliminary findings, as described below, we suggest ttiai epigenetic analysis of 
multicopy DNA sequences may lead to the identification of the genes that predispose to 

30 major psychosis. At least 35% of the human genome consists of numerous copies of 
differem transposons dispersed in the genome (NB: only --5% of the human genome ate 
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exons» i.e. eoding sequences of ftincf ional genes) (Yoder JA, Walsh CP, Bestor TH. 
Cytosine mediylatioii and the ecology of intragenomic parasites. Trends Genetics, 
13(8):335-40. 1997) . The range of copies of repetitive DNA fragments varies widely: 
There aie 106 copies of Alu sequences and 105 copies LI elements per genome (ibid.)- 

5 The general opinion is that such sequences represent excess baggage of our evolutionary 
heritage and do not perform any specific genomic function. This fraction of the genome 
is sometimes called "junk" or "parasitic" DNA. Such elements are not generally harmful 
to a cell as long as they do not exhibit any transcriptional activity and do not affect the 
integrity of the host genome. Transcriptional inactivation of the multicopy elements is 

10 achieved by their cpigcnetic modificaUon- It has been widely observed that DNA 
metbylation plays a role in silencing various types of DNA sequences. Since it is 
becoming evidenr that DNA methylation may act in concert with bistone ace^y lation (Nan 
X, Campoy FJ, Bird A. MeCF2 is a transcriptional repressor with abundant binding ates 
in genomic chromatin. CeU, 88(4):47l-8l. 1997X chromatin conformation can also be 

15 considered a fecior that plays a role in the inactivation of renotransposons as well as any 
oflter newly integrated DNA sequence. The findings fliat Alu and LI elemems as well as 
numerous other retroclements arc methylated and transcriptionally inactive in the 
genomes of ftongi, plants, and mammalsprovided the basis for postulating that epigeneiic 
DNA modification represents a host genome defense system (Bestor TH. DNA 

20 metbyltransfcrase in genome defence, tn: Epigenetic mechanisms of gene regulation. Eds: 
Russo VEA, Martienssen RA, Riggs AP, Cold Spring Harbor Laboratory Press, pp. 
61-76, 1996; Yoder JA. Walsh CP,Besior TH. Cytosine methylation and the ecology of 
intragwomic parasites. Trends Genetics, l3(8):335-40, 1997). 

25 The epigenetic parameter may add a new dimension to the already available 
developments inpsychiatric research. In ourexperiments we serencfipitously detected that 
while the overwhelming m^ority of Alu sequences in the genomic DNA extracted from 
human brain are methylated, ^ small fraction of such sequences is unmethylated. The 
ori^n of such selective Alu demethjiation is not clear. Without wishing to be bound by 

30 theory, this most likely represents a local ftllure of the qiigeneiic host defense system, 
which has no direct impact to the normal fimctioning of the brain. On die oOicr hand. 
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such locri epigeneiic changps may nM be limited lo the Alu sequences and may extend 
to the sunoimding genes, causing dysrcgHlation which may be detrimental to the cells. 
SuppottingevidenceferihiBCcmiesfiomtheobservaiionUwiteiroel^ 
demetbylated because they ate located in the genomic region that ^as subjected to 
5 genetic and epigenetic re-organization, to malignant cells, it was detected that some Alu 
( Rubin CM. VandeVoort CA. Teplite RL. Schmid CW . Alu repeated DNAs are 
differentially methylated inprimate germ cells. Nucleic Acids Research. 22(23):Sl2l-7, 
1994; Sinnen D. Richer C. Deragon JM. Lahuda D. Alu RNA transcripts in human 
embiionalcaicinom8cells.Modclofpost-lranscriptionalselecti«mofniaslcrse«p^ 

10 Journal of Molecular Biology. 226(3):689-706. 1992) »id LI (Ftorl AR. Ftemto KH. 
Niederacher D. Gerharz CD. Seifen WU Scbulz WA. DNA methylation and the 
mechanisms of CDKN2A inacrivaiion in tiansitional cell carcinoma of the urinaiy 
bladder.Laboraloryfcvcsligation,80(l0):1513-22.2000:JurgensB,Schmitz.Dn«e^ 

Schulz WA. Hypomeihylation of U UNE sequences prevailing in human moftelial 
1$ cardnoma.CancerReseamh.S<K24):569g.703.l99fi)elememsbecamehypometbyhit^ 

and transcriptionally active. 

Our working hypothesis ftr the experiments performed thus far is that identification of 
immelhytated "junkr DNA sequences in major psychosis may allow for the mapping of 
20 specificeeaomicregionsinwhichepigeneticre-arrangeraentsoccuired.Dysfoncrionof 

gtmes that are localized in such regions may be the actual cause of psychotic symptoms, 
while the demcthylated multicopy element sequence would serve as a reporter, a signal 
that allows for localizationof epigenetic cbariges in the genome. Based on the above, we 

investigated hypomeihylaied Alu elements in major psychosis. 

B£sill^. DNA samples were extracted Jirom the ftontal cortex of 40 post-roonem bram 
rtssuesofindividuals who were aflteted with schizophrenia and bipolar disorder as w^^^ 
asconiiol individuate- Inoider to avoidartifecBielatedtoparUalbrainDNAdcgnuJati^^ 
(which may simulate bypomethytarion and pmduce anifiM:tual Alu amplification; sec 
30 below), the IbUowing procedure was performed- Undigested io«l genomic DNA was 
fcM:tionatedonanagarosegel.thehi^molecuU.rweighi(>l5.2racb)DNAwascm 



JUN^Oe'-ZQQZ 14:01 



FROtit^^^s Fax 3 



-27- 

the gel. The gel block, contaitiing DNA, was treated with a gel digesting en2yme. agarase. 
Without any additional procedures, such high quality DNA samples can be funher 
digested with a specific xiesnriction enzyme and sabjecied to fusther analyses. The 
methylaiion sensitive restriction enzyme, HpaU, was used for digestion of DNA and the 
S unmethylated fraction of biain specific DNA (fragments smaller than axbitrarily selected 
6kb) were separated from the methylated fraction of DNA using gel electrophoresis. The 
<6kb fragments were puiified from the gel using glass mslfc. Screening for the presence 
of Alu's in the purified unmeths^ated DNA was performed using PGR and primers 
complemmtaiy to the Alu sequence. AIu aroplicons were cloned into a vector and 

10 iransfisrmed imo B.coU XL 1 -blue;. Up lo ten recombinant clones from each PCR product 
were sequenced from six individuals affected with major psychosis and four controls. The 
location of such Alu sequences were identified using human genome databases 
Oiap:/^genome.ucsc.edu/). It was detected that the AIu's from affected individuals in 
numerous cases corresponded with the genomic regions that showed evidence for linkage 

15 in genetic linkage studies of major psychosis. For example^ one of the Alu sequences 
cloned from an affected individual mapped to chr lq21, the region that was linked to 
schizophrenia (lod score of 6,S» the strongest evidence for linkage m schizophrenia 
genetics thus far) in large multiplex schizophrenia families (Brzustowicz LfA. et a].,, 
2000). In addition, an Alu clcme from another psychosis patient exhibited sequence 

20 homology widt lq42, die translocation region in a sdnzopfarenia kindred (St Clair D» et 
al. 1990). Odier genomic regions where Alu sequences mapped to the linkage 'qiots', 
include 5qll (although Unkage to this region [Sheningron R. et aI.1988] was not 
leplicated in other smdies. two large Idndreds exhibit lod scores between 2 and 3 in favor 
of linkage). Other identified regions include: Sq3S (chr S data reviewed in Crowe RR» et 

25 al. 1999), 8p23 (lod score 3.8 in a large Swedish schizophrenia kindred), 8p2U 10pl4, 
thepericentromctric regions of chr 10 and 10q26 (WiIdenauerDB» et. al. 1999), I IpIS 
and 1 lql3, 14q32 (Craddock 1999), I2pl3 and 12q23-24 (Detera-Wadleigh SD. et al. 
1 999), and 22q 1 3 (Numberger Jl Jr. et al. 1999). The 22ql 3 region exhibited evidence for 
linkage in numerous studies and harbors a deletion region in velo-cardiofacial syndrome, 

30 a disorder quite often resuhing in psychotic symptoms (Chow EW. et al. 1 994). For more 
details on the localization of the cloned Alu sequences see Figure l« Alu sequences that 
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aie located in the vicinity (within 100.000 hp) of ccMfine 8en« "« «» '^V^ ^• 
Sequences of the eloAeil Alu's are piovided in Figure 3. 

The above results aie of Interest for the foDowine reasons. First, clustering of Ae Ato 
5 sequences into the groups of affected individuals and controls, if replicated in an 
indqtendent sample, would indicate that qiigenetic changes of repetitive DNA elements 
in some genomic loci are specific to m^jor psychosis. TTjis would be a significant step 
ibnvaid in the li^i of the myriad of non-specific molecular changes in the brains of 
patients affected with major psychosis. Second, genomic location of the hypomethylaied 
10 Ala's match with the loci that exhibit evidence for linkage to major psychosis. Traditional 
genetic linkage studies face m^jor difficulties in fine mapping of the regions of 
susceptibility and identification of the acnial gene dysfimction that leads to majwr 
psychosis. Typically the regions that exhibit evidence for linkage to major psgrchosis are 
in the range of -10-40 cM, te. -10^0 million nucleotides fniaker GK, et aj., 2001; 
15 Tsuang IWfT. ei al 2001; BiayNI. and Owen MI. 2001: Gershon ES. 2000; Numbetger 
HJr. el al. 2000). and such regions contain hundreds of gates. Screemng of such a lar^ 
number of gpnes hy traditional strategies for the detection of DMA variation is not 
posable. For fine mapping of prediposing genes using the tiansmi^on disequilibrium 
test, vay large samples are required; this strategy has not been ptodttctive in psydiiacric 
20 research thus for. In conclusion, the "junk" DNA-based search for major psychosis genes 
may lepresem a valuable 'shortcut' in the identification of such genes. Hyporoediylated 
Alu's may piiqwim very specific sites of genomic DNA qiigenetic dysfimction of which 
may cause major psychosis. 

25 Example2: Idenrificaiion of genes involved in etiology of schizophrenia or bipolar 
disorder based on epigenetic analysis 

The genes *at are located in the regions exhibiting both linkage to miuor psychosis and 
epigenetic abnormalities in Alu sequences will be subjected to a detailed analysis. Using 
30 dieCeleraHumanGenomeDatabasewe willroakealistofgeaesfiom Iq21.5qll.8p23, 
I0pl4. Ilpl5, 12pl3. 12q23-24. 22ql3. chr Y, and several olher loci to be fimher 
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invesiigated from the epigeneric point of view. It is expected that such list wiU include 
^30 gmes. We will attempt to match patients and controls for age. sex, and race. Cases 
with drug and alcohol abuse will not be used in the study. Treatment with neuroleptic 
medications is also a significant confounding factor. Neuroleptic naSVe schizophrenic 

S patients are very r^, but cases with long neuroleptic free pre-moitem intervals are quite 
common. For example, in a recent study, one third of brain samples were neurolept ic-free 
for more than 6 months (Hernandez I» et al.» 2000) and during this period, -50% of 
schizophrenia patients are expected to relapse (Vigyera AC, et al., 1997) . If our 
hypothesis of epigeneiic dysregulation in schizophrenia and bipolar disorder is correct, 

10 disease associated q>igenetic abnamuUilies in the brain should recur after ncuiolepctc 
treatment is stopped. Regardmg the sample size, since thm are no precedents of 
epigenetic studies in major psychosis, power analysis on the sample size is not possible. 
Wears starting with a relatively large sample by post»monem brain study standards. Our 
plan is to investigate the prefrontal cortex from 25 post-mortem patients affected with 

15 major psychosis with >6 months of neuroleptic free period before death and a similar 
number of controls. Over 70 brain samples from individuals who were affected with 
schizophrenia or bipolar disorder as well as controls are available at our laboratory and 
this sample increases every year. Total mRNA from the brain tissues will be extracted 
using standard BNA exnraction techniques (Chomczynski P.et al., 1987) and subjected 

20 to reverse transcription and quantitative PCR cunplification using the Bio-Rad Real Time 
PCR equipment (hnp://www.bio~rad.coni/iCyclcr/). This experiment will allow for the 
quantitative evaluation of the steady state level of the of the candidate gene. *Is it 
P-actin* inRNA will serve as an internal standard for the degree of mRNA degradation. 
Expression of Is it p-actin is independent of the age of an individual and treatment 

25 (Schramm M» et al, 1999) and therefore can be reliably used in our experiment as an 
estimate of the degree of post-mortem degradation. Steady state mRNA level of each 
individual gene will be normalised according to its Is it P-actin mRNA data. The null 
hypothesis is that the group of affected individuals exhibits no differences in the steady 
state mRNA levels of the selected genes in comparison to the group of controls. The 

30 genes that reject the null hypothesis, i.e. the ones that exhibit statistically significant 
differences in steady state mRNA levels in affected tissues versus controls, will be 
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subjected to fimfaer analysis. The problm is that not all genes that exhibit significant 
diffisiences in expression may cany epigenetic defects. Cases when dion^ in steady 
state mRNA levels that may occur wifliin hoius or even minmes after some triggers are 
^plied, in the absence in any epigenetic changes in the genome have to be excluded. 

5 Typically, epigenetic DNA modification targets cyto^es in CpG dinucleotides, each of 
v^ich can be either methylated (melC) or unmethylated (C). The gold standard technique 
for DNA methylaiion analysis is based on the reaction of genomic DNA with sodium 
bisulfite under conditions such that cytosine is deaminated to uracil but metC remains 
unreacted (Fiommer M, et al. 1992), Sequencing of bisulfite modified DNA reveals 

10 which cytosines were methylated and which cytosines were not. This approach has been 
fiilly operationalized in our kibonitoiy (Popendikyte V. et al.» 1999). The project can be 
treated as successfiil if a g&ac fcom the list of -30 candidates exhibit disease specific 
q)igenetic abnormally. 

IS llie preset invention has been described with ns^rd to prefened embodiments. 
However, it will be obvious to persons dciUed in the an that a number of variations and 
modifications can be made without departing from the scope of the invention as 
described herein. 
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All references are herein incorporated by reference. 
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c) fractionation ot s»a pw» 
fragments of adesitcd size; „^^tof said DNA fragments of a desired size 

f>sequeacedet«tntaatio«ofs«dPCRprod«ci 

^^:r:nnaii.associ.edwi.anon.M.^^^^^^ 

• -idnon-MendeBandiseaseissdeciedftomihegroup 
2.Themed.odofclaiml.whercmBa.dnouM^ 
consistingofscluzophrenia.bi,olard«o«ier.c«.cer.andd, 

u • ™d sample that exhibits cbaxacterisiics of a noo- 
3. The meAod of claim I. wherein said sample tna 
Meodelian disease is brain tissue. 

thai exhibits characteristics of a non- 

Mendclian disease is selected fioia the group 



cortex. 



^ . ^sired size is less than 10 kb. 
5. The method of claim I. whcrem satd desired 
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6. The method of claim I, wherein said endogenous DNA elcnusnt is a mttlti-copy DNA 
element. 

7. The meOiod of claim 6, wherein said muIli*copy DNA element is selected ftom the 
group consisting of LIME, Sn4B, LI, and AIu.. 

8. The method of claim l» wherein said merbylation-senaitive restriction enzyme is 
selected fiom the group consisting of AaiH (GACGTC); Bsbl236I (CGCG); Bshl2SSI 
(CGRYCG); BshTI (ACCOGT); Bsp68I fTCGCGA); Bspl 191 (TTCGAA); Bspl43n 
(RCCGCY); BsulSI (ATCGAT); CfrlOI (RCCGGY); Cfr42I (CCGCGG); Cpol 
(CGGWCCG);Eco47ni(AGCGCT);Eco52I(CGGCCG);Eco72l(CACGTG);Ecol05I 
(TACGTA); Ehel (GGCGCC); Esp3I (CGTCTC); FspAI (RTGCGCAY); Hinll 
(GRCGYC); Hin6I(GCGC); Hpan(CCGG); Kpn2I (TCCGGA); MluI(ACGCGT);Notl 
(GCGGCCGC); Nsbl (TGCGCA); Paul (GCGCGC); Pdil (GCCGQC); Pa23n 
(CGTACG); Pspl406I (AACGTT); Pvul (CGATCO); Satt (QTCGAC); Smal 
(CCCGGG); Smul (CCCGC); Tail (ACGT); and TauX (GCSGC). 



9. Method of identifying a gene having an epigeneticaily altered expression pattern that 
contributes to a non-Mendeltan disease in an oiganism» said method comprifung: 

a) extraction of graomic DNA fiom a sample that exhibits characteristics of a 
non-Mendelian disease; 

b) digestionofsaidgenomicDNAwidiajnethylatioi^seitsitiverestrictionen2^ 
to produce a pool of restricted DNA fiagments; 

c) fiaciionation of said pool of restricted DNA firagments to obtain DNA 
fiagments of a desired size; 

d) amplification of at least a segment of said DNA fiagments of a desired size 
with primers that anneal to an endogenous DNA element to produce a PCR product; 

c) cloning of said PCR product into a sequencing vector; 
Q sequence determination of said PCR product to obtain a sequence of said PGR 
product; 
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g) comparing said sequence agaipsi a genomic database to assign a locus for said 
epigeneric abm»malicy associated widi a non-Mendelian disease; 

h) searching said database to idendfy a gene located proximal to said locus; 

i) comparing expression patterns of said gene located proximal to said locus 
witbin a test sample that exhibits characteristics of said non-Mendelian disease with 
expiessioit patterns of a coiresponding gene within a conixol sample lo idemigr said g«e 
having an epigeneiically altered expression partem. 

10. A getie isolated by the method of claim 9. 

11. Method of isolating a probe for detecting an epigenfitic abnormaiiiy associated widi 
a non-Mendelian disease, said method comprising: 

a) extraction of genomic DNA from a sample that exhibits characteristics of a 
non-Mendelian disease; 

b) digestionof said genomic DNA with amedtylation-sensitiveresniction en^me 
to produce a pool of restricted DNA fragments; 

c) fracticmation of said pool of testricied DNA fragments to obtain DNA 
fragments of a desired size; 

d) ampUficarion of at least a segmotxt of s»d DNA fiagmems of a desired size 
widi primers that atmeal to an endogenous DNA element to produce a FCR product; 

e) using said PCR product as said probe to detect said epigeneuc abnomtality 
associated with a non-Mendelian disease in anodier santple. 

12. A }unbe isolated by the method of claim 11 . 

13. Themethodofofclaim 1 wherein said detecting an epigenetic abnormality associated 
with a complex non--Mendelian disease, is used to diagnose an epigenetic abndimality 
associated with a complex non-^Mendelian disease. 
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Tlie invention can be summarized as follows. A method comprising extraction 
of genomic DNA from a diseased tissue or diseased population of ceUs; hydrolysis of this 
ONA with methylation-sensitive restriction enzymes, and subsequent fractionation of 
UNA fragments and purification of DNA fragments of a desired size* PCR amplification 
using primers that hybridize to endogenous PNA elements including. PCR products of 
such elements are cloned and sequenced using standard molecular biology techniques 
Known to the filled artisan and the resultant sequences are mapped on the genome using 
any commercially or publicly available human genome database. These cloned 
endogenous ONA elements indicate a loci of putative epigenetic abnormality or 
epigenetic dys-regulation and indicates genes that predispose a patient to a complex, 
non-Mendelian, multi-factorial disease, such as, but not limited to, cancers, diabetes, 
schizophrenia, or bipolar disorder 
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1 Hypothetical protein FU23017, FU23017 
highly expressed in cancer, rich in leucme, HEC 
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1 LOC57 167. similar to SALLl (sal (Drosophila).like 1 
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OSM, oncostalin M (Ref?? 2 papers found oa bipolar WHAT??) 
LTF, leukemia inhibitor}' factor (cholinergic 
EPI64, EBP50-PDZiflterach)cof64kD 
SF3AL splicing factor 3a, subuoit I. l20kD 
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BRE, brain and reproductive organ-expressed (TWFRSFIA 
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SHC3« neuronal She j 


FOLRL folate receptor I precursor 

SKD3, suppressor ofpotassium transport defect 3 

fNPPLU inositol polyphosphate pbosphataso-like 1 

F0LR2» folate receptor 2 precursor . 

ARLX« aristaless(Oro5ophila) homeobox 


nuclear protein, ataxia-telangiectasia locus, NPAT {Lange, 1989 #1 l4;Weekd. 
1989 #115} 


CNGB 1 » cyclic nuclcof rde gated channel beta t 
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KCNNI» potassium intermediate/small conductance (REF 7? I ileras Foundon 
Schizophrenia. 2 Items found on bipolar. 
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